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Abstract 



We use support theory, in particular the fretsaw extensions of Shklarski and Toledo |ST06a| , 
to design preconditioners for the stiffness matrices of 2-dimensional truss structures that are 
stiffly connected. Provided that all the lengths of the trusses are within constant factors 
of each other, that the angles at the corners of the triangles are bounded away from and 
7r, and that the elastic moduli and cross-sectional areas of all the truss elements are within 
constant factors of each other, our preconditioners allow us to solve linear equations in the 
stiffness matrices to accuracy e in time 0(n 5 / 4 (log 2 n loglogn) 3 / 4 log(l/e)). 

1 Preconditioning 

When solving a linear system in an n x n positive semidefmite matrix A, the running time of 
an iterative solver can often be sped up by supporting A with another matrix B, called a 
preconditioner. An effective preconditioner B has the properties that it is much easier to 
solve than A, and that A has a low condition number relative to B. 
We define here generalized eigenvalues and condition numbers: 

Definition 1.1. For positive semidefinite A, B, the maximum eigenvalue, minimum eigen- 
value, and condition number of A relative to B are defined respectively as 



where x _L null(S) means that x is orthogonal to the null space of S. 

Note that the standard condition number of A can be expressed as k(A) = k(A,I). 
The conjugate gradient method is an example of a linear solver that can be sped up using a 
preconditioner. The precise analysis of the running time can be found, for example, in |Axe85| : 
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k{A, B) = A 
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(A,B)/X min {A,B) 



Theorem 1.2 ( |Axe85| ). For positive semidefinite A,B, and vector b, let x satisfy Ax = b. 
Each iteration of the preconditioned conjugate gradient method multiplies one vector by A, solves 
one linear system in B, and performs a constant number of vector additions. For e > 0, it 
requires at most 0(-v/ k(A, B) log(l/e)) such iterations to produce a x that satisfies 



x 



x\\a < 4 x \\a 



1.1 Using a Larger Matrix 

In certain situations it may be easier to find a good preconditioner for a matrix A if we treat A 
as being larger than it really is. That is, if we pad A with zeros to form a larger square matrix 
\A Ol 

A' = , it may be simpler to find a good preconditioner B for A'. We then need to show 

how to use B to yield a preconditioner for the original matrix A. To this end, we define the 
Schur complement: 



Definition 1.3. For square matrices A and B 
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B n B 

Bf 2 B22 



, where square submatrix B\\ is 



the same size as A, and such that B22 is nonsingular, the Schur complement of B with respect 
to A is 

Bs = Bu - BuB^ B 12 

While Bs will not automatically be a good preconditioner for A simply because B is a good 
preconditioner for A', we do know that the maximum eigenvalue will be the same: 

Lemma 1.4. For positive semidefinite A, B , \ ma x (A',B) = X max (A,B s ). 

We also know that solving a linear system in Bs is as easy as solving a linear system in B: 

x b 

Lemma 1.5. B = ^ implies Bsx = b 

For completeness, we give proofs for these lemmas in Appendix lAl 
1.2 Congestion-Dilation 

Suppose that we have matrices A and B that can be expressed as the sums of other matrices, 
i.e. A = Ai and B = ■ Bj, and that we know how to support each A{ by a subset of the 
Bj matrices. In this situation, we can use the following lemma to show how B supports A: 

Lemma 1.6 (Congestion-Dilation Lemma). Given the symmetric positive semidefinite matrices 
Ai, A n , Bi, B m and A = ^2 i A{ and B = ^2jBj and given sets Ej C [l,...,m] and real 
values Si that satisfy 

^max(Ai, ^ ^ Bj) < Si 

it holds that 



\ max (A,B) < max I ^ 



A proof of this lemma is given in Appendix^] It is an adaptation of, for example, Proposition 
9.4 in [BH03]. The expression X^i jgs s * can De though of as the congestion of matrix Bj, 
analogous to the concept of congestion for graph embeddings which we define in Section |H 



2 Trusses and Stiffness Matrices 

Definition 2.1. A 2-dimensional truss T = (n, {vi}™ =1 , E, 7) is an undirected weighted pla- 
nar graph with vertices [n] = {l,...,n} and edges E, with vertex i € [n] embedded at point 
Vi £ K 2 . We allow multiple vertices to be embedded at the same point. 

An edge e = (i,j) £ E, also called a truss element, represents a straight idealized bar from 
Vi to Vj, with positive weight 7(e) denoting the product of the bar's cross-sectional area and the 
elastic modulus of its material. A truss face is a triple {i,j, k} such that (i, k), (j, k)} C 

E and no vertex is in the interior of the triangle formed by Vi,Vj,Vk- Every truss element is 
required to be contained in some truss face. 

There is a particular type of linear system that arises when analyzing the forces on a truss 
using the finite element method. We define here the type of matrix we wish to solve: 

Definition 2.2. Given a truss T = (n, {i>i}™ =1 , E, 7), for each truss element e = € 
E we define a length 2n column vector u e = [u\ ... u^. n ] T with 4 nonzero entries satisfying 
[tig* -1 ttg ! ] T = — Ue^] T = |^'_^| , and we define the 2n x 2n matrix 

a 7( e ) T 

A e — -j -u e ti e 

\Vi - Vj\ 

The stiffness matrix of the truss is then given by: 

A r = Y, A e 

Note that a stiffness matrix is positive semidefinite, since for all x we have 

x T A T x= Y , 7(6) x T u e u T e x= Y , 7(6) , (s r u e ) 2 >0 

^ \Vi-Vj\ ^— ' \Vi-Vj\ 

e=(vi,Vj)eE e=(vi,Vj)eE 

We would like to restrict our attention to trusses with a unique, well-behaved stress-free 
position. To this end, we make the following definitions: 

Definition 2.3. The rigidity graph Qq- of a truss T is the graph with vertex set given by the 
set of truss faces of T, and with edges connecting faces that share an edge. 

We say that a truss T is stiffly-connected if (1) Qq- is connected, and (2) for every i G [n], 
Qj- is connected, where Qj- is the graph induced by Qq- on the set of faces containing vertex i. 

The main contribution of this paper is an algorithm TrussSolver for solving linear systems 
in stiffness matrices of stiffly-connected trusses. We will describe the algorithm later, but we 
state here the result of our analysis of the running time: 

Theorem 2.4 (Main Result). For any stiffly- connected truss (n, {vi}^ =1 , E, 7) such that 
• all truss elements have lengths in the range [l m in^max\ 



Figure 1: The truss on the right is a fretsaw extension of the truss on the left, as given by the 
fretsaw algorithm. The vertex positions in the fretsaw extension are distorted slightly so as 
to be able to distinguish vertex copies in the same location. The subgraph F is shown as solid 
lines, while the rest of the trusses' connectivity graphs are shown as dotted lines. Note that the 
connectivity graph of the fretsaw extension has one edge not in F. 

• all angles of truss faces are in the range [O m i n ,Ti — Omin]- 

• all weights are in the range [ n f m in>'ymax\- 

for positive constants l m in,lmax^minjlmin>7max} TrussSolver solves linear systems in matrix 
At within relative error e in time O (ra 5 / 4 (log 2 raloglogn) 3//4 log(l/e)) 

2.1 Fretsaw Extension 

We will precondition the stiffness matrix using a fretsaw extension, a technique described in 
[ST06a]. The fretsaw extension of a truss is a new truss created by splitting some of the vertices 
into multiple copies, without changing the identity of the truss faces. 

Definition 2.5. Let T = (n, {vi}f =1 , E, 7} and T = (m,{v' i }'^L 1 ,E',j') be 2- dimensional 
trusses, with m > n. Let F and F' be the sets of truss faces of T and T respectively, Let 
p : F' — > F be a bijection between the faces, and let n : [m] — > [n] be a surjection on the vertices. 
T is the (p, 7r)-fretsaw extension of T if 

• for all i € [m], the ith vertex of T is a copy of the ir(i)th vertex ofT, i.e. v[ = v^n\ 

• for all faces f = (i,j,k) £ F' , the vertices of face f are copies of the vertices of p(f), i.e. 
p(i,3,k) = (vr(i),7r(j),vr(A;)) 

• every edge € E' has the same weight as the edge of which it is a copy, i.e. j'(i,j) = 
7(vr(i),vr(j)) 

Since each vertex in T has at least one copy in T , we follow that convention that Vi € [n] : 
ir(i) = i. Thus, for i £ [n], vertex i in T can be considered the "original copy" of vertex i in T . 

A fretsaw extension has the following property which makes it a useful preconditioner: 

Lemma 2.6 (see [ST06aJ Lemma 8.14). Let A,B be the stiffness matrices ofT,T' respectively. 
Let F>s be the Schur complement of B with respect to A. 

If T' is a fretsaw extension of T , then \ m i n (A, L>s) > 1/2 



We give a short proof of this lemma in Appendix Combining this with Lemma 11.41 we 
find: 



Corollary 2.7. Let A,B be the stiffness matrices ofT,T' respectively. Let B$ be the Schur 
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complement of B with respect to A, and let A' = be the same size as B. 

If 'T is a fretsaw extension of T , then k(A,Bs) < 2X max (A', B) 

Now, consider a truss T and fretsaw extension T\ with respective rigidity graphs Qq- and 
Qtl By construction, every pair of faces that share an edge in T must also share an edge in 
T. That is, if we let p(Qt') = pC/2)) : (/ij/2) G Qt 1 } denote the graph isomorphic to 

Qt' on the faces of T, then p(Qt') Q Qt- 

As it turns out, for any subgraph H C Qq- of our choice, we can construct a fretsaw extension 
with Qq-i (almost) isomorphic to H. We present a linear-time construction here. For technical 
reasons, this construction also takes as input a map r : [n] — > F that for each vertex in T specifies 
one face containing that vertex. The construction ensures that the face in T corresponding to 
r(i) contains the original copy of vertex i. This feature will be useful later, and does not diminish 
the generality of the algorithm. 

Lemma 2.8. There exists an linear-time algorithm (T',p) = f retsaw(T, H, r) that takes a 
stiffly- connected truss T = (re, {vi}f = ^,E, 7) with face set F, a connected spanning subgraph H 
°f Qt, and a map r : [n] — > F from each vertex to a truss face containing it, and returns a 
stiffly- connected (p, it) -fretsaw extension T = (m, {vi} r ^ 1 , E' satisfying: 

1. for all i € [re], i € p _1 (r(i)) 

2. H' C Q' 

3. if \H'\ = n — 1 + k (i.e. H' is a spanning tree plus k additional edges), then \Q' — H'\ < k 
where Q' denotes Qt 1 ■ and H' denotes p~ 1 (H) = {(/i,/2) € Q' : (p(/i), p(/a)) € H} 

Proof. Here is the construction, an example of which is given in Figure [TJ 



f retsaw(T, H, r) 

First, for each vertex i in T, we create the set 7r _1 (i) of copies of vertex i in T: 
(Recall that we call i G vr _1 (i) the "original copy".) 

• Let Fi denote the set of faces of T containing vertex i, and let Hi denote the graph induced 
by H on F{. For each connected component of Hi, we put one copy of vertex i in T . The 
original copy is assigned to the connected component of Hi containing face t{%). 

Now, for / E Fi, let <j>(i, f) G vr~ 1 (i) denote the copy of vertex i that is assigned to the component 
of Hi containing /. It is straighforward to construct the faces of T : 

• For each face / = k) of T, we create a face p~ l {f) = (4>(i, /), 4>(j, /), <f)(k, /)) in T . 



The first property is directly enforced by the construction. 

To see why the second property holds, consider an edge (/1, f-i) G H, where is the edge 
shared by faces f± and $2- Since edge (/i,/2) is present in both Hi and Hj, faces /0 _1 (/i) and 
p~ 1 {f2) will share the same copies of vertices i and j, and so they too will share an edge. 



As for the third property, suppose that H' has n — 1 + k edges, and thus divides the plane 
into k regions. It suffices to show that each such region contains at most one edge in Q' — H'. 

Let (/i, fy) be an edge in Qq-') — H'. Let be the edge shared by f\ and f2- Let Q' { and 
H[ f= Q[ denote the graphs induced on Fi by Q' and H' respectively. 

Since fi and fj share the vertices i and j, we know there must a path from f\ to fi both 
in H[ and in H'-. Of course neither path contains the edge (/i,/2), since it is not in H'. The 
only possibility then is that Q[ is a cycle H[ U /2)}j where fl] is a path from /j to /2, and 
similarly for Q£. Thus Q\ U Qj. = fl? U H'j U {(/i, / 2 )}, and so (/1, / 2 ) is the only edge of Q' 
inside the region enclosed by cycle H[ U H'-. □ 



3 Path Lemma 

We will need to construct a fretsaw extension with a truss matrix that can be solved quickly. In 
particular, the fretsaw extension we construct will have a connectivity graph that is close to a 
spanning tree (i.e. close to having re — 1 edges), because we can efficiently find a sparse Cholesky 
factorization of its truss matrix. The following result is proven in Appendix (?somewhere?): 

Lemma 3.1. Let A be the stiffness matrix of an n-vertex truss T, where Qj- comprises a 
spanning tree R plus a set S of additional edges. A Cholesky factorization A = PLL T P T can be 
found in time 0(n + \S\ 3 > 2 ), where P is a permutation matrix, and such that systems in lower 
triangular matrix L can be used to solve systems in A in time 0(n + \S\ log \S\). 

Now, of course we want to construct a fretsaw extension whose truss matrix provides good 
support for the original truss matrix. If we can give a supporting subset of faces in the fret- 
saw extension for each element in the original truss, then we can use Lemma 11.61 to bound 
the maximum generalized eigenvalue. To this end, we show that a simply-connected set of 
faces that connects two vertices supports the matrix of an element between the pair of vertices 
proportionally to the cube of the number of faces: 

Lemma 3.2 (Path Lemma). Let T = (n, {■Uj}^ =1 , E, 7) be a k-face simply- connected truss, and 
let eo be a truss element with weight 7(eo) between any pair v p , v q £ V of vertices in the truss, 
such that in T and eo 

• all truss elements have lengths in the range [l m i n , Imax] 

• all angles of truss faces are in the range [9 m i n ,TT — m i n ]. 

• all weights are in the range ["fmin^max]- 

for positive constants Imin^ Imax 1 ®mini 'Jmini ^) 'max • Then. 

Xmax(A eo ,A T ) = 0{k 3 ) 

We first note the simply-connected truss T must contain a simply-connected subset of faces 
whose rigidity graph is a path, such that the first face in the path is the only one containing 
vertex p and the last triangle is the only one containing vertex q. Thus, without loss of generality 
we may assume that T is itself such a "truss path" , because removing the extra faces can only 
increase the value of X m ax- 

Let us then number the faces in the path in order f\,...,f n , and let us number the n + 2 
vertices as follows: 



Figure 2: A truss path from vq to tin, with triangles and vertices labeled appropriately. 



• Let p = be the vertex in fi but not f 2 - 

• Let vertices 1 and 2 be the pair of vertices shared by fi and fi- 

• For 3 < i < n + 1, let i be the vertex in but not /j-2- (In particular, q = n + 1.) 
Furthermore, for 2 < i < n, consider the labeling of the three vertices in /j_x 

• i is one of the vertices shared by and /j. 

• Let denote the other vertex shared by and 

• Let a(i) denote the vertex in but not fi. 

For completeness, define s(l) = 0, s(n + 1) = n, a(n + 1) = n — 1, so for 1 < i < n: 

fi = {s(i),i, i + l} = {a(i + 1), s(i + 1), i + 1} 
and the set of truss elements is given by: 

n+l 



E = {(0,l)}u(J{(s(i),i),(a(i),i)} 



i=2 



An example of this labeling is given in Figure [2j 

In the proof we make use of the following canonical definition of a perpendicular vector: 



Definition 3.3. The counterclockwise perpendicular of a vector x 



x 2 



€ R 2 is 



X' 



-X 2 



We note some useful properties of the perpendicular: 
Claim 3.4. For any x,y, z € B 2 : 

1. (- x )^ = -( x ±) 

2. {x + y) 1 - = x ± + y ± 

3. \x T y- L \ = |a;||y|| sin#| where 9 is the 
1 



'erence in angle between x and y 



, (x-y) T {x-z) J 
4 ' (x-y)T(y- z y 



5. 



y' 1 ' x 1 - 



_L T 
y x 

x 1 r jp- 



1 
1 



Proof. The first three properties are trivial. 

Thp fourth ran pasilv hp sppti hv ( x -y) T ( x - z ) ± _ (x-y) T (x-y) 1 - (x-y) T (y-z) 1 - _ n , i 

ine iourtft can easily De seen by (a _ i;) T( y _ z )± - fx-^Tf,,-^! + rx-«iTr„_ z u - u + i 
Here is a proof of the fifth: 



(*-») r (w-*)- L ^ (^-2/) T (y-2) J 



_L T 

g y y ' x 

y T x ± 



T 



x T y- 
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X2Vl 


X2V2 


1 


X\V2 


X2V2 




"1 


0" 


X2Vi ~ xiy 2 


-xiyi 


~X\V2_ 


X2V\ ~ X\V2 


-xiyi 


~X2V\_ 







1 



□ 



Now, let x* = [xq,x*, . . . ,X2 n _iY be a vector that maximizes ^t^~ over all values of x. 

2f] 



Let x* denote [x^^x* 1 ; 



In particular we choose an x* such that (x\ — Xg) is parallel to (v\ — vo), by taking advantage 
of the following property of the null space: 

Lemma 3.5. Define x R = [x R , . . . ,a^_ 1 ] T to be the vector satisfying [x R i _ l ,x R i ] T = (vi — vq)- 1 . 
x R is in the null space of both A eo and Aq-. 

Proof. For the matrix A e of any single element e = (i,j), we have: 

AexR = \ Vi l Vj \ UeU e xR = \ Vi 1 Vj \3 Ue ((^ ~ v i) T ( v i ~ V i) ± + ( V i ~ V if( V 3 ~ V i) ± ) = 

□ 

Note that we can eliminate the component of x* perpendicular to (vi — vq) by adding the 
appropriate multiple of x R . 

Now, let us focus momentarily on a single vertex i, and the two elements (s(i), i) and (o~(i), i) 
that connect vertex i to lower numbered vertices. The terms x T A^ s ^^x and x T A(„(^ i\x are 



zero respectively when 
and 



{vi - v s(i) ) T ( 



Xi X 



.(»)) = o 



{Vi - v a(i) ) T ( Xi - x a(i) ) = 

ind x a (i} = x * a (iy we would li 
satisfies both of the above equations. 



Supposing we set x s {i) = X * S U) anc ^ x a(i) = x *a(i\-> we wom d like to define di to be the vector 
such that setting Xi = x* — di satisfies both of the 



In particular, we define the vectors 
di = 

where 



X ~^ X Q % — 1 

vl-Ku+MK^-Kd)) 2<z<n+l 
(«* - V*(i))H V i - v a(i) ) T 



1 (Vi - V a(i) ) T (Vi - v^) 1 - 
We claim that these satisfy the following properties: 
Lemma 3.6. The following are properties of the dis: 
1. For all G E, j < i: 

(vi - Vj) T (x* - x*) = (vi - Vj) T di 



2. For alii: 

\ rn * 



1 % 
* * \ <r~ i max i , 



mm vmin ■ , 



Proof of 1. The statement is trivial for element (0, 1) There are two other types of elements we 
must consider: (s(i),i) and (a(i),i). 
For an element (s(i),i) we have: 

, , T , , ,t, , . , , (»i-°»(i)) r (°i-»,(i)) 1 (''i-'' g (i)) T ( a! ;(i)-<(i)) 

(„,-„,(„) * = (.,-„«„) (*,-*«„) + { ^ — p 

= fa - v s{i) f(x* - x* w ) + 
using the fact that (vi — v s ^) T (vi — w.,^) 1 = 



□ 



For an element (a(i),i) we have: 

, , T , , ,t, * * («i - « J(i) ) r («i - ^(i)) 1 !^ - « g( i)) r (a!; (i) - 

{Vl -v c{l) ) d i = {v i -v^) (x l -x s[i)) + (vt-v^nvi-v.^ 

= (Vi - v a(i) ) T {x* - + ( Vi - v u{i) f{x* s{i) - x* a{i) ) 

= {vi - v a{i) ) T (x* - x* a(i) ) 

Proof of 2. For i > 2, using the fact that {s(i),a(i)} = {s(i — 1), i — 1}, we have 

x *i ~ x *s(i) = d i~ Ri( x l(i) ~ K(i)) = d i ± Ri&i-i ~ K(i-i)) 
Since x\ — Xq = di, we recursively find that 

i-l 

\ x * ~ x * s (i) \ < \d i \+Y,\{RiRi-i---Rj+i)d j \ 

j'=i 



The following finishes the proof: 
\RiRi~l ■ ■ ■ Rj+\dj\ 

(»* - «<T(fe)) r (»k-l - W,(jfe_i))- 



(»i «,(,)) JI^ _ U(T(fc) ) T (t, fc - u s(fc) ) J 



(vj+i - v a{ j +1) ) T dj 



I Oj+i - v a{j+1) ) T (v j+1 - v s(j+1) ) 

' fe ~+2 V ^ fc ~ V v(k)) T { V k - V s(k)) X J ~ V a(j+1) ) T ( Vj+1 - V a{j+1) )- 



by the fact that {s(i),a(i)} = {s(i — — 1} 



{vi-Vsty) 1 - TT (±i).— 

fc i^ 2 (^■+i-^( i +i ) ) T (^+i-'y s ( i+ i))- 



by Claim ETH 4 



< 



\Vj - V S (i)\\Vj+l ~ v a(j+l)\\ d j\ 

\vj+i - v a (j +1 )\\v j+1 - v s (j +1 )\smO min 
by Claim 03 



K - v s{i )\\dj\ 



Vj+i - v s {j+i) I sin O m ir, 

Imax i , i 
< : : rfi 



Im.i.n. Sin 



mm 0111 "mm 



□ 

To finish proving the path lemma, we will need to use the following fact: 
Lemma 3.7. Let u%, U2 be unit vectors whose angles differ by 9. Then for any v and a,b > 0: 

(ujv) 2 ^ (u^v) 2 ^ sin 2 6>|i>| 2 



a + b 



Proof. Let a be the angle between u\ and v. 

We must show that - cos 2 a + | cos 2 (a + 9) > sin 2 9 

Recall that we wish to prove 



□ 



^■max {A eo , At ) 



(as*) A eo x* 
(x*) T A r x* 



0(n 3 



Let us first bound the denominator (x*) T Aq-x* . 
(x*) T A r x* 

n+l 

( 



> In 



«l-«0 

|t>i— «o| 



i=2 
T 



( x * ^o) 



"Imir. 



\Vi - v \ 



n+l 



n+l 
i=2 



l"i-"a(i)l 



31 i s a(i)) 



K - ««(t)l 



+ 



V 



»1 - 



i=2 





lY^^y ^ 


2 




2 \ 
















- + " 






V 








/ 



by Lemma [3]6] and the fact that d\ = x\ — Xq is parallel to (vi — Vq) 

12 ^L 1 / s : n 2« . IJ.I2 



> In 



\di\ 



\Vi~Vq\ ^ \\Vi - V s ty\ + \Vi - 



by Lemma [37T1 



n+l 
i=l 



'Ymin sin Grain \ d% 



Next we bound the numerator: 



(x*) T A eo x* - 7(6o) 



by Lemma 13.61 



\v n +i ~ v \ 



\v n+1 - v \ 



, Imax * *|2 

— -j l^n+l ~~ x ol 

- 'Ymax ft * * * I i 

- I IFn+l ~~ X s{n+1) \ + l^sfn+l) ~ ^sfsfn+l)) I + 



^ Imax 



n+l 



I i 1 \ 'max X - ^ i i I 

n + l; V \di\ 

sin H„ • - ^— ' 



Imin \ Imin SU1 , , ,, 

\ i=l 



Combining the above, we get: 



\ (A A \ — ( X *) T Ae X* 
A-max V^-eo i ) 



(x*) T A r x* 



3 I Y^ rl ~'~l I rj 

, 2 (lmin\ lmax\^ i=1 1 i 'J , , 1A 2 

< — ; . - .„ (n + 1) 



sin 



^ @min \ I max J 7mm X^i=l 



- / ' /IJ/fl 1 I ma. r 

sill m in V ''max J ^fmin 



1 • V -v 

''mm \ I max i -. \3 



where the last inequality follows from Cauchy-Schwarz. 

4 Graph Embeddings 

Our remaining task is to describe how to map edges to supporting face sets with low congestion, 
as required for Lemma 11.61 We need some graph theoretic notions which will inform how we 
choose to support the edges. 

For a graph with edges E, we use V{E) to denote the power set of E. Thus V(E) includes 
all paths in the graph. 

Let us define the notion of embedding vertex pairs of a graph into paths in a subgraph: 

Definition 4.1. For an unweighted graph (V,E), a subgraph HOE, and a set Z of pairs of 
vertices in V , an embedding of Z onto H is a map tt : Z — > V(H), where tt(v,w) Q H is a 
path in H whose endpoints are v,w. 

The stretch of it is str(7r) = Ylzez I 71 "! 2 )!- 

The congestion of tt is cong(ir) = maxj- e # Ylzez-fe-K(z) M- 2 )! 

A particular example is embedding the edges of a graph onto a spanning tree: 

Definition 4.2. For an unweighted connected graph (V, E) and spanning tree T C E, let T(v, w) 
denote the path in T that connects v to w. 
The stretch of T is str(T) = Y.e&E l T ( e )l- 

We will make use of algorithms that take a graph and generate a spanning tree, augmented 
with a few additional edges, such that given vertex pairs have a low congestion embedding into 
the augmented tree. First, we need to create a low stretch-spanning tree. The best known result 
is from [EEST06j : 

Theorem 4.3. There exists an algorithm T = LowStretch(G), that takes a connected graph G = 
(V, E), runs in time 0(\E \ log 2 \E\), and outputs a spanning tree T with stretch 0(\E \ log 2 \ V\ log log 

We can then use the low-stretch spanning tree to create an augmented spanning tree with 
the desired low congestion embedding. This algorithm is given in Appendix [Bj 

Theorem 4.4. There exists an algorithm S = LowCongestAugment(G, T, Z, ip, k) that takes a 
planar graph G = (V, E), a spanning tree T of G, a set Z of pairs of vertices in V , an embedding 
ip : Z — > V{E), and an integer k. The algorithm runs in time 0(\E\ log \ V\ + cong(ip)\E\) and 
returns a set of edges S C E of size at most k, such that there exists an embedding tt : Z — > 
V(TUS) with congestion 0(^str(T)cong(ip)). 



TrussSolver(7", k) 

Let A be the stiffness matrix of T — (n, {vi}f =1 , E, 7), and let F be its face set. 

1. Define r : [n] — > F to map vertex i to an arbitrary face in the set F^ of faces containing i. 

2. Run R = LowStr etcher) i n time 0{n log 2 n). 

3. Define Z = {(r(i),r(j)) : G E}. 

Define an embedding if) : Z — > V(Qt), by defining ip(T(i),r(j)) to be an arbitrary path from 
r(i) to r(j) in Fi UFj. (We know Fj and Fj intersect because some face contains edge 

Run S = LowCongestAugment(Q7-, R, Z, ip, k) in time 0(n log n). 

4. Run (T, p) = ((V', E', i),p) = f retsaw(T, R U S, r). Let 5 be the truss matrix of T'. 

5. Use Lemma 13. II to find a Cholesky factorization B = PLL T P T in time 0(n + /c 3 / 2 ), such that 
L can be used to solve equations in B in time 0(n + klogk). 

6. Run preconditioned conjugate gradient using Bs, the Schur complement of B with respect 
to A, as the preconditioner. Use L to solve equations in Bs, by solving equations in B (see 
Lemma ll.5p . 

The relative error will be down to e after 0(y/ k(A, Bs) log -) iterations. 



Figure 3: The TrussSolver algorithm 



5 Solving the Linear System 

In Figure[3]we present the complete TrussSolver algorithm for solving linear systems in matrix 
a A that is the stiffness matrix of truss T = (n, {vi}f_^, E, 7) with face set F. The algorithm 
preconditions A using the stiffness matrix B of a fretsaw extension T = (m, E', 7') 

with face set F' . The algorithm uses a parameter k that will be chosen later. We will 
show that (with the right choice of k) the algorithm attains a relative error of e in time 
O (n 5 / 4 (log 2 n log logn) 3 / 4 log i). 

We want to use the Congestion Dilation Lemma (Lemma I1.6P to give an upper bound 

I A 

on k(A,Bs)- Recall that for A' = of the same size as B, Corollary 12.71 says that 

k(A, Bg) < 2X max (A', B). To bound \ max (A' , B), for each edge (p, q) E E we will give a face 
subset F' „ C F' connecting p to g, such that this embedding of edges to truss paths has low 
congestion. 

In particular, let E' p q C E' be the set of truss elements in the faces so that Tp q = 
(m, {v'j}, E' , 7') is the "subtruss" of T' comprising the faces F Ptq . Lemma 1531 states that 



so Lemma 11.61 yields 



^max(A(p ;q ),A Tpiq ) = 0(\F Ptq \ ) 



^max(A',B) < max V 0{\F' \ 

(p,q)eE:eeE^ q 



It remains for us to describe the truss paths T£ q that yield the desired bound. 

Recall that we have constructed a subgraph RUS £ Qt, for which the LowCongest Augment 
algorithm guarantees that there exists an embedding ir : Z — > V(R U S) of low congestion. Let 
us denote 7r Pi9 = 7r(r(p), r(q)), the path in R U S C Q-j- connecting r(p) to r(g). Map this path 
back into Q r > to get the path n' p q = p' 1 ^^) = p _1 (/2)) ■ ih, h) G 7r p ,J- We then 

form 71' from the set F' „ of faces in ir' . 

yiH f,y 

Let us first determine the congestion of ir more precisely. The algorithms LowCongest Augment 
and LowStretch guarantee respectively that cong(7r) = 0(^str(R)cong(ifj)) and str(R) = 
0(n log 2 n log log n) . 

As for cong(ip), we have 



< (max |^(«) 

zeZ: y eV>0) 



cong(t/j) = max 



max j{z G Z : (/ € "0(z)}| 



Now note that for alH, IFJ < tt 25 - = 0(1)- Since any ib( T i, T i) onr y contains triangles in Tj UT,-, 
\tp(T{i),r(j))\ = 0(1), and so max, eZ |^(z)| = 0(1). 

Similarly, say that q € Qt is a pair of faces sharing the edge Since i and j are the 

only vertices that the pair of faces have in common, q can only be in a path ip(r a ,T^) if one of 
a or (3 is i or j. So \{z : q e tp(z)}\ < \{{a,j3) £ E : a = i or a = j}\ < ^- = 0(1). 

Thus, cong(ip) = 0(1) and cong(ir) = 0(| log 2 nloglogn). 

We now have: 

k(A,B s ) < max £ 0{\F^) 



= O 



(p,q)€E:eeE^, 



( 



\ 



max > \F Da \ 

(p,q)eE: 
eeE' 



= O 



max 

(fiJj)eQr' 



"/'-'/ 

(p,9)€S: 
(fi,fj)€n' p , q 



O 



max 

(fi,fj)eRuS 



E 



7T. 



{fi,fj)€Tp,q 



o 



max 

(fi,fj)eRuS 



\ 



zeZ: 

(fi,fj)eiT(z) 



O (cong(Tr)Y 



n 3 „ 2 



= O ( ^3 (log 2 n log log n) 3 



Steps 1-4 take time 0(nlog 2 n + k 3 / 2 ). Each conjugate gradient iteration takes time 0(n + 
klogk), and the number of iterations is 

I n 3/2 J 

y/K(A, B s ) log - = 0(^(log 2 raloglogn) 3/2 )log- 
Thus, our total running time is: 

O log 2 n + k 3/2 + n 3/2 k~ 3/2 (log 2 n log log n) 3/2 (n + k log fc) log i 
For A; = n 5 / 6 (log 2 raloglogn) 1//2 this gives a running time of O (n 5 / 4 (log 2 n log log n) 3 / 4 log ^) . 
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A Preconditioning Lemmas 



We first prove several lemmas dealing with the Schur complement. Recall that the Schur com- 
1 '"" is Bs = B\\ — B12B22B12 



plement of B 



B 12 B22 



Lemma A.l. If B is positive semidefinite then for any x 



Proof. 

[x T y T }B 



mm I [x T y T ] B 



= x 1 Bsx 



= x T B u x + x T B 12 y + y T Bf 2 x + y T B 22 y 

= x T (B u - B l2 B 22 l Bl 2 )x + x T B 12 B 22 1 Bf 2 x + x T B 12 y + y T Bf 2 x + y T B 22 y 
= x T B s x + (y + B 22 l B^ 2 x) T B 22 (y + B 22 x B T X2 x) 



x T B$x + 



> x T B$x 



-1 T 



y + B 22 l B\ 2 x 



B 



y + B^B^x 



The last inequality holds because B is positive semidefinite, and it is an equality for y 



Lemma 11.41 For positive semidefinite A,B, X max (A' , B) = \ max (A, B$). 
Proof. 



^max(A' , B) 



max 
x ,y 



max 



max ■ 



[x T y T ] 


A' 


X 




y. 


[x T y T ] 


B 


X 




y 



x T Ax 



[x T y T ]B 
x 7 Ax 



x x T B$x 

^max(A, B,s) 



□ 



The third equality uses Lemma lA.ll 

implies B$x = b 



x 




b 


y. 








Lemma 11.51 B 

Proof. Multiplying by 



-Bi 2 B 22 
I 



gives: 



-Bv2,B 2 2 





B\2 




X 






B22 




y. 




~B S 


" 




X 




Bf 2 


B22_ 




y. 





5 12-D 2 2 





b 








□ 



□ 



Lemma [TT6] (Congestion-Dilation Lemma). Given the symmetric positive semidefinite matrices 
Ai, A n , Bi, B m and A = Yli-^i an d B = Ylj Bj and given sets C [1, ...,m] and real 
values Si that satisfy 

^max (Ai , ^ Bj) Si 

it holds that 



^max{A, B) < max j ^2 s i 



Proof. Let us write A =4 B to mean that B — A is positive semidefinite. We are given that 



So we have: 



Ai =4 s i ^ ~] Bj 



= E b j E s * 




max 



□ 



Lemma 12.61 (sec [STOGaJ Lemma 8.14). Let A,B be the stiffness matrices ofT,T' respectively. 
Let Bs be the Schur complement of B with respect to A. 

If T is a fretsaw extension ofT, then \ m i n (A, B$) > 1/2 

Proof. Suppose that T = (m, {vi}^l =1 , E' , 7'} is the (p, 7r)-fretsaw extension of T = (n, {vi}f =1 , E, 7}. 
Define M to be the 2n x 2m matrix that for all i G [n],j £ [m] satisfies 



~M 2l -i 


M 2i " 


-{: 


i = vr(j) 


Mjy_i 






otherwise 


J) G 




).*Cj)) = 


MA^ j^M T . However any element in T is 



part of at most two faces, and so can have at most two copies in T' . Thus, A =<! MBM T =<! 2A. 

Recalling that ir(i) = i, we note also that M takes the form [J Ml] for some (2n — 2m) x 2m 
matrix Mi. Thus, for any x, we have: 



2x T Ax > x T MBM T x 

= [x T M lX T ]B 

> x B$x 



X 

Mix 



where the last inequality holds by Lemma lA.ll 



□ 



B Augmented Spanning Tree 

We give a proof of Lemma 14.41 which we restate here for convenience. It is a generalization of 
an algorithm from [ST06b]. 

Lemma 14.41 There exists an algorithm S = LowCongestAugment(G, T, Z, ip, k) that takes a 
planar graph G = (V, E), a spanning tree T of G, a set Z of pairs of vertices in V , an embedding 
ip : Z — > V{E), and an integer k. The algorithm runs in time 0{\E\ log \V\ + cong(ip)\E\) and 
returns a set of edges S C E of size at most k, such that there exists an embedding ir : Z — > 
V(T U S) with congestion 0(j:str(T)cong(i{))). 

We need to make use of the following tree decomposition algorithm from [ST06b]: 

Theorem B.l ([ST06b] Theorem 8.3). There exists a linear-time algorithm 

((Wi, W c ), p) = decompose(T, E, n, k) 

that on input 

• a tree T on vertices V 

• a set E of edges forming a planar graph on V 

• a function rj : E — > Et + 

• a positive integer k < X^eG-E v( e ) 

outputs sets Wi, W c C V , where c < k, and a function p that maps each edge in E to either 
a set or pair of sets in {Wi, W c } such that: 

• V = U c i=1 W i; and for all i ^ j, \W { U Wj\ < 1 

• for all i, the graph induced by T on Wi (which we denote Ti) is connected 

• for each (u,v) G E, there are i,j (possibly equal) such that u £ Wi and v E Wj and 
p(u,v) = {W i ,W j ] 

• the graph ([c],{(i,j) : 3e G E s.t. p(e) = {Wi, Wj}}) is planar 

• for all Wi such that |Wj| > 1, 

e£E:Wi€p{e) e&E 

We can now prove the lemma. 
Proof. Here is the LowCongestAugment algorithm: 



S = LowCongestAugment((V, E),T, Z, tp, k) 

1. For each e G E, define 17(e) = £ z:eG ^) E e 'g^) l T ( e ')l 

2. Set ((^.....WcJiP) = decompose^, £,77, |_§J). 

3. Let Eij = {e G E : p(e) = {Wj, VFj}}. For all nonempty Eij define 

Si j = arg min |T(e)| 

and let 5" be the set of all the Sij. 



Let us analyze the running time. In step 1, we must compute 77(e) for all e G E. First we 
compute and record \T(e)\ for all e € E. [ST06b| gives a method to do this in time 0(|-B| log |V|). 
We can then compute each 77(e) by summing ^z-eev(z) — con 9(V0 of the \T(e)\ values. 

This gives a total time of 0(|-E| log \ V\ + cong(ip)\E\) for step 1, and the remaining steps clearly 
run faster than 0(\E\ log |^|). 

Let also note that \S\ < k. This follows from the fact that the graph ([c], '■ Eij ^ 0}) 

is planar and has \S\ edges. Since a planar graph on c < k/3 vertices cannot have more than 
3c — 6 < k edges, we have that |5| < k. 

Now let us demonstrate the existence of a low congestion embedding. For each (v, w) G E, 
let us define a path tt(v , w) in T U S from v to w, as follows: 

• If p(v,w) is a singleton {Wj}, then we simply define tt(v,w) = T(v,w) 
Note that ir(v,w) C T,,. 

• If p(v,w) is a pair {Wj,VFj}, then let (v',w') = Sij G S and define ir(v,w) = T(v,v') U 
{(v',w')}UT(w',w). 

Note that 

ir(v, w) C Tj U Tj U {sjj} 

and that 

|7t(u,«j)| < |T(u,u;)| + \T(v',w')\ + 1 < 3|T(u,io)| 
Furthermore for (u,io) G 5, define 7r(u,u;) = U eg ^( wil ,)7r(e). 

Fix an eo £ TU5. By construction of ir, e can be only in a path 7r(z) if it is either in S or 
in some subtree Tj.. 

With this in mind, define Iq such that Tj is the subtree containing e. (There is at most one 
such tree, but if there is none then choose iq arbitrarily.) Define i\,i2 such that if e 6 S then 
e = z h,i2- (If e 5 1 then choose i\,i2 arbitrarily). Then for any z G Z, if e G 7r(z) then at least 
one of the following must hold: 

• For some e G V'( 2 )) G p( e )- 

• For some e G ip(z), p{e) = {W^^W^}. (In particular, Wi x G p(e).) 
Thus we can bound the congestion of it on eo: 

2:e e7r(z) z:{ee0(z):Wi o e/o(e)}^0 z:{eGt/>(z): Gp(e)}^0 



We note that for any i: 



Thus: 



E ^ E E 

z:{e£*(>(z):Wi£p(e)}^(& e:Wi£p(e) z:e£ip(z) 

< E E E W)\ 

e:Wi€p(e) z:e£ip(z) e'€ip(z) 

^ 3 E EE mo 

e:VKiGp(e) z:e£ip(z) e'et/>(z) 
e:WiGp(e) 

< T E^) 



E kwi < f E^( e ) 

z:eoG7r(z) eSE 

SyEE E PV)I 

e£-B z:eetp(z) e'&ji{z) 

= f Ei T ( e, )i E 

e'e_B z:e'eip(z) 

24 x - 

< y E \ T ( e ')\ con 9W 

e'GE 

24 

= —str(T)cong(tp) 



Since the choice of eo was arbitrary, we have 



24 

cong(ir) < — str(T)cong(ip) 



□ 



